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1. Summary 



MuRFI is an E3 ubiquitin ligase central to muscle catabolism. It belongs to the 
TRIM protein family characterized by a tripartite fold of RING, B-box and 
coiled-coil (CC) motifs, followed by variable C-terminal domains. The CC 
motif is hypothesized to be responsible for domain organization in the fold as 
well as for high-order assembly into functional entities. But data on CC from 
this family that can clarify the structural significance of this motif are scarce. 
We have characterized the helical region from MuRFI and show that, contrary 
to expectations, its CC domain assembles unproductively, being the B2- and 
COS-boxes in the fold (respectively flanking the CC) that promote a native qua- 
ternary structure. In particular, the C-terminal COS-box seemingly forms an 
a-hairpin that packs against the CC, influencing its dimerization. This shows 
that a C-terminal variable domain can be tightly integrated within the conserved 
TRIM fold to modulate its structure and function. Furthermore, data from trans- 
fected muscle show that in MuRFI the COS-box mediates the in vivo targeting of 
sarcoskeletal structures and points to the pharmacological relevance of the COS 
domain for treating MuRFI -mediated muscle atrophy. 
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Figure 1. Schematic domain composition of MuRFI. MFC refers to an MuRF family specific motif, and AT denotes a C-terminal acidic tail, which is predicted to be 
unstructured. Constructs used in this study are indicated. 



2. Background 

The regulation of protein catabolism by the proteasome 
system receives ever-increasing attention owing to its 
impact on the (patho)physiology of the eukaryotic cell. The 
TRIM protein family consists of E2 and E3 ubiquitylating pro- 
teins that arbitrate cellular processes, such as growth and 
differentiation, transcription, apoptosis and viral response 
[1,2]. Consequently, TRIMs have been linked to multiple 
pathologies, including cancer, familiar Mediterranean fever, 
Opitz/BBB syndrome, mulibrey nanism, thyroid carcinomas 
and myopathy [3-6]. Despite their functional diversity, 
TRIM proteins invariably share a tripartite fold consisting 
of a RING finger (R), one or two RING finger-like B-box 
domains (B) and a helical segment predicted to form a 
coiled-coil (CC) motif. This RBCC fold constitutes the con- 
stant N-terminal fraction of TRIM proteins, but variable 
domains can be found in C-terminal position (e.g. PHD, 
COS-box, PRY-SPRY). Those specific domains define the 
classification of the more than 70 members of the TRIM 
family into nine distinct classes (CI-CIX, where C signifies 
C-terminal subgroup) [7,8]. 

The function of the TRIM /RBCC fold is to serve as a scaf- 
fold that induces homo- and heteromeric interactions across 
diverse E2-E3 ubiquity lation systems, leading to their for- 
mation of pleiotropic complexes in the cell [2,9,10]. The CC 
domain is thought to be central to this function by contribut- 
ing to position domains within the TRIM fold, promoting 
high-order assembly and mediating molecular targeting. 
However, CC domains from TRIMs have atypical sequences 
with poorly defined heptad-repeat compositions that confer 
on them complex associative properties and a pronounced 
tendency to aggregate. This hinders their characterization at 
the molecular level and, as a result, little understanding 
exists of their self-assembly process. 

Muscle-specific RING fingers proteins (MuRFs) are E3 ubi- 
quitin ligases that associate with the sarcomeric cytoskeleton 
reportedly through their CC domains [11]. MuRFs form the 
C-II TRIM class [7], whose C-terminal specific fraction contains 
a COS (C-terminal subgroup One Signature)-box motif and an 
intrinsically disordered acidic tail (figure 1). There are three 
known members of the MuRF family— MuRFl [12], MuRF2 
[13] and MuRF3 [14] — all involved in controlling the trophicity 
of striated muscle tissue. The three MuRFs are encoded by 
different genes but are remarkably conserved: approximately 
81% sequence identity across their RB fractions and approxi- 
mately 36% in their CC domains [6]. MuRFl is the 
best-studied member of the family. It is strongly upregulated 
by atrophic stimuli, such as immobilization, denervation. 



nutritional deprivation, ageing and disease (e.g. cancer, 
sepsis and renal failure) [15-17]. MuRFl targets components 
of the contractile sarcoskeleton; namely, myosin [18,19], tropo- 
nin-T and titin [12,20,21]. Thus, it is regarded as the critical E3 
ligase that acts on the cytoskeleton in situ, contributing to the 
disassembly of the myofibril. MuRFl also appears to have sig- 
nalling roles in the cell as it interacts with a broad range of 
cellular factors, including ubiquitin carboxyl-terminal hydro- 
lase 13 (USP13), the SUMO E2 ligase Ubc9 and the 
transcription regulator GMEB-1 [22]. MuRFl deletion attenu- 
ates muscle wasting and it is a pursued pharmacological 
target [15,23]. Despite this, the structural and functional differ- 
ences between MuRFs are poorly understood, as is the balance 
of their expression across muscle types and during develop- 
ment [24,25]. MuRF2 and MuRF3 are not transcriptionally 
upregulated by atrophic stimuli, but they act synergistically 
with MuRFl (e.g. myosin is co-degraded by MuRFl and 
MuRF3 [26], while MuRFl and MuRF2 jointly modulate cardiac 
h3rpertrophy by acting on CARP/EEFIG [21]). Such functional 
coupling might reflect the formation of MuRF hetero-oHgomers 
in the cell, as is characteristic of TRIM proteins. 

The molecular understanding of MuRFl targeting is 
scarce, with only the interaction with the titin myofilament 
having been studied in vitro [11,12]. Using recombinant 
samples, we showed that MuRFl binds M-line titin with 
high affinity through its helical domain (HD) [11]. The binding 
site in titin is formed by a tandem of Ig-Ig-Fn domains (A168- 
A170) just N-terminal to titin kinase. We previously character- 
ized this tandem structurally at atomic level and identified the 
determinants of its binding [11]. By contrast, little is known 
about the structure of MuRFl and its CC scaffold, which is 
central to molecular targeting. Here, we study the full-length 
HD of MuRFl, spanning its at3rpical CC sequence and the 
COS-box flanking domain. Our data reveal the interrelation 
of these motifs both structurally and functionally within the 
TRIM fold of MuRFl, and, in particular, the high significance 
of the COS-box in sarcomere targeting. 

3. Results 

3.1. The helical domain of MuRFl has low propensity 
to coiled-coil formation 

To estimate the associative properties of the HDs of MuRFs, 
we predicted their potential for CC formation. CC motifs con- 
sist of two to five amphipathic a-helices that wind around 
each other to form, typically, a left-handed supercoil, thereby 
inducing protein oligomerization [27]. CC sequences are 
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Figure 2. Sequence analysis of the HD of MuRFs. (a) Sequence alignment of known MuRF sequences (MuRFl, MuRF2, MuRF3; h = human, m = mouse, r = rat, 
b = bovine, p = orangutan). The unstructured acidic tail is excluded. The colour code reflects sequence conservation as identity percentile: dark blue > 80%, light 
blue > 60%, light grey > 40%, white < 40%. (b) Secondary structure prediction of the HD fraction (lower panel in (a)) of human MuRFl (grey), MuRF2 (cyan) 
and MuRF3 (red). Cylinders indicate helices, (c) Prediction of CC regions in human MuRFs (colour code as above). The total probability for CC formation is shown as a 
solid line, with the probability for dimeric CC assembly indicated by a dashed line. Boxed in yellow is the sequence identified here as the CC-compatible segment of 
MuRF proteins. Strictly conserved residues are shown in the x-axis. 



characterized by a heptad-repeat of amino acids denoted by a 
to g. Positions a and d classically harbour hydrophobic 
residues, which constitute the structural core of the motif. 
Positions e and g often host charged groups that form intra- 
and inter-helical salt bridges crucial to fold stability and 
interchain registry. An initial prediction of secondary struc- 
ture content indicated that the helical region of MuRFs 
consists of a long, uninterrupted helix (helix HI), followed 
by two short C-terminal helices (helix H2 and H3) linked 
by loops and mapping to the COS-box motif (figure 2). 
This prediction is consistent with a previous study that esti- 
mated the secondary structure content of the helical region 
of MuRFl to be 70% a-helix and 30% random coil, based 
on circular dichroism data [11]. Predictors indicated that 
only the C-terminal end of the long helix HI is compatible 
with CC formation. The prediction was consistent for all 
three MuRFs, although MuRF2 had the shortest predicted 
CC-segment owing to the presence of bulky residues (M227 
and F266) in core heptad positions a and d. Such groups are 
poorly accommodated in the limited interface of coiling a-heHces 
[28]. The tendency for CC formation was modest for all MuRFs, 
as reflected by the low probability scores (figure 2c). The scores, 
in addition, could not resolve a preference for dimeric or trimeric 
association as both states yielded comparable probability 



values. However, previous studies that used size exclusion 
chromatography coupled to multi-angle laser light scattering 
(SEC-MALLS) on the helical fraction of MuRFl showed it to 
form dimers [11,29]. In the light of these data, the predictions 
led us to anticipate that the CC-prone, C-terminal end of helbc 
HI must form dimeric CC motifs, thereby being a molecular 
determinant of self-assembly in MuRFs. 

3.2. Crystal structure of the MuRFl CC segment reveals 
a tetrameric palindrome 

To gain an insight into the assembly of the CC-prone segment 
identified, we elucidated the crystal structure of the corre- 
sponding fragment from human MuRFl (MuRFl^^) to 
2.1 A resolution (table 1). The crystals contained four copies 
of the MuRFl chain in their asymmetric unit. These 
assembled into two parallel dimers, each having an 'open scis- 
sor' conformation that intercalated through their C-terrrdnal 
ends to form a palindromic, inverted tetramer (figure 3). 
This packing resembles that of the CC domains from the post- 
synaptic density protein Homer (PDB code 3CVE), Ndell 
(2V66) and BST2/tetherin (3MKX), although sequence 
similarity between these and MuRFl^^ is not detectable. In 



Table 1. Diffraction data statistics and model refinement parameters. 



space group 


P2i 




unit cell dimensions 


a = 70.79 A, b = 


24.41 A, 




c = 75.39 A, /3 


= 107.65° 


X-ray data 


beamline 


103 (diamond) 




detector 


ADSC Q315r 




wavelength (A) 


0.97 




resolution (A) 


20.00-2.10 (2.15- 


2.10) 


no. unique reflections 


14 626 (1008) 




Rsym (1) 


2.7 (45.0) 




multiplicity 


3.64 (3.74) 




completeness (%) 


97.7 (99.3) 




\I(T (!) 


17.24 (3.31) 




model refinement 


no. reflections in working/ 


13 899/725 




free set 






no. protein residues 


228' 




no. solvent molecules/buffer 


55/20^ 




molecules 






R-factor/R-free (%) 


21.18/26.15 




RMSD bond length (A)/bond 


0.006/0.836 




angle (°) 






Ramachandran statistics 


favoured/allowed/outliers (%) 


99.07/0.46/0.47 





'Out of a total of 244 amino acids, 16 residues were structurally disordered 
and are missing from the model (corresponding to 6.5% of the structure). 
The missing residues are as follows: chain A (G271), chain C (G-3, E269, 
P270, G271), chain B (E269, P270, G271), chain D (G-3, A-2, M-1, D214, 
D268, E269, P270, G271). 

^Ordered buffer components are glycerol and acetic acid. 



MuRFl, however, the chains within each dimer are out of 
registry by one full heptad-repeat and their crossing angle 
is broad, leading to the unpairing of both N- and C-termini. 
The resulting exposure of hydrophobic residues at the 
C-terminus permits the formation of an interchain hydro- 
phobic core that supports the assembly of the inverted 
tetramer (inset in figure 3). Other than that core, the contacts 
within each dimer are few, with only two salt bridges (K224- 
E236' and E236-K238') providing chain recognition (analysis 
of chain interfaces used PISA [31]). One additional salt 
bridge (K238-D268''; chains DA in figure 3a) is present in 
the structure, but it contributes to tetramer stabilization. 

A further analysis with TWISTER [32] (figure 3h) and 
SOCKET [28] (electronic supplementary material, figure SI) 
was used to identify stretches in each chain that formed 
parallel, antiparallel or simultaneous parallel /antiparallel 
interactions. The analysis revealed that MuRFl^^ can establish 
manifold interactions, being able to support concurrently the 
formation of parallel and antiparallel dimers and tetrameric 
helical bundles. These promiscuous self-associative interactions 
are in agreement with the mixed probability scores of the CC 
predictions. This led us to question the prevailing view that 



this MuRFl region associates into robust CC motifs and that 
it directs the productive quaternary assembly of MuRFl. 

3.3. Coiled-coil fraction of MuRFl does not assemble 
productively in solution 

Given the unexpected assembly of MuRFl^^ in the crystalline 
state, we studied its association in solution using SEC-MALLS. 
This technique yields an accurate determination of molecular 
mass (MM) without being influenced by molecular shape, an 
important consideration when dealing with strongly aniso- 
metric molecules. SEC-MALLS measurements yielded an 
average MM of 11.8 kDa (table 2; electronic supplementary 
material, figure S3a). This value is intermediate between the 
calculated MM of a monomer (7 kDa) and a dimer (14 kDa) 
of this sample. This indicates that the tetrameric crystalline 
state does not predominate in solution, where the sample 
appears to form mostly dimers, probably in equilibrium with 
a monomeric fraction. 

To investigate whether the extraction of MuRFl^^ from 
its molecular context could have weakened its self-associ- 
ation, we assayed next an MuRFl construct spanning the 
full-length of helix HI (MuRFl^^; figure 1), which probably 
constitutes the entire CC motif. The expectation was that 
this sample would show a stable, canonical, dimeric associ- 
ation. However, SEC-MALLS data revealed that the sample 
is trimeric and /or tetrameric in solution (table 2; electronic 
supplementary material, figure S3b). The assemblies in sol- 
ution may reproduce the interactions observed in the 
crystal structure of MuRFl but these might now be stabil- 
ized by the longer length of the interacting chain. It can 
be concluded that the long helix HI of MuRFl does not 
assemble into native CC dimers. 

3.4. Coiled-coil flanking motifs modulate the formation 
of MuRFl rod-shaped dimers 

The irregularities in MuRFl^^ association led us to investigate 
the role of CC flanking motifs in assembly. Using SEC-MALLS, 
we analysed the oHgomeric state of an MuRFl construct 
comprising the full-length helbc HI, the preceding B2 box 
and the subsequent COS-box (MuRFl^^'^*^'*^^^; figure 1). The 
data (table 2; electronic supplementary material, figure S3c) 
showed that the sample forms a range of oligomeric species, 
including high-order aggregates, but that a sizeable fraction 
forms small assemblies with an average MM of 49.2 kDa. 
This value is in excellent agreement with the theoretical MM 
of 48.1 kDa for a dimer of this construct, confirming that the 
dimer (and not the tetramer) is the ground association state 
of this sample. 

To explore the global conformational features of 
MuRFl^^'^^'^^^, we imaged the dimeric population fraction 
using electron microscopy on negatively stained samples 
(figure 4ia,h). Micrographs showed a rod-like morphology of 
approximately 17 + 3 nm length and 2.6 + 0.36 nm cross- 
section (n = 614). This overall shape suggested that, as 
expected, the HD forms an elongated shaft with the B2-box 
in apical position. However, the molecular length was shorter 
than anticipated and could be explained by just the length of 
helix HI (29 helical turns with a pitch of 5.4 A approximates 
15.6 nm) plus the B2-box (approx. 2 nm). This led us to infer 
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Figure 3. (a) Crystal structure of MuRFI^^, where helices have been coloured to indicate: no interchain assembly (grey), parallel interaction (beige), antiparallel 
interaction (yellow), simultaneous parallel and antiparallel interaction (orange). Salt bridges are labelled. The inset (central panel) shows the hydrophobic core 
formation of the tetrameric region, (b) Twister analysis of each MuRFI^^ half. The structural point to which the values correspond is indicated by the sequence 
(in dimeric or tetrameric state) displayed in thex-axis. Solid lines indicate pitch and dashed lines radius. The pitch and radius vary along the length of the molecule, 
but the values of the central region of each dimeric half are close to those of a canonical dimeric CC, where pitch approximates 150 A and radius 5 A [30]. The 
analysis of knobs-into-hole packing of each dimer using SOCKET [29] indicated that 18 residues in the AC dimer exhibited a conventional CC packing (7,7,4 repeat), 
while the BD dimer had 25 residues in CC arrangement (7,7,7,4 repeat) (electronic supplementary material, figure SI). In the sequence that follows from this, 
TWISTER shows the opening of the coil and a resulting helical phase transition (between residues 252-QL-253 in chains A and D, respectively, partnered to 
chains C and B at motif 259-LV-260; highlighted by a red box) that defines the switch from a dimeric to a tetrameric association. 



that the two short C-terminal helices (i.e. the COS-box) must 
fold against the B2-CC fraction. 

3.5. The C-terminal COS-box of MuRFI is predicted to 
have a spectrin-like fold 

Next, we explored the fold of the COS-box through ah initio 
modelling. This motif has no homology with any protein of 
known structure, preventing the application of comparative 
modelling. In ah initio modelling, the three-dimensional struc- 
ture of proteins is derived from their amino acid sequence by 
stitching together suitable protein fragments using simulated 
annealing. The method is particularly successful when applied 
to all-a proteins [33] owing to the greater accuracy of their 
secondary structure prediction and the relatively limited 
modes of helical packing compared with the variable twists 
of p-sheets. Here, we employed the two leading ah initio 
modelling programs: Quark [34] and Rosetta [35]. Quark 
assembles fragments of variable length identified by fold recog- 
nition methods. Only available as a server, it returns a set of ten 
predictions and estimates of model reliability as TM-scores. 



Rosetta assembles fragments of 3- and 9-residue length ident- 
ified using PSI-BLAST. At the fragment assembly stage, 
numerous models are clustered by structural similarity and 
centroid representatives of large top clusters considered as can- 
didate fold predictions. The appearance of a large top cluster is 
generally indicative of accurate fold predictions. 

First, we modelled the sequence spanning the crystallo- 
graphic MuRFl^^ and the COS-box, as EM data suggested 
that the COS-box might require the preceding MuRFl^^ por- 
tion for packing. Models calculated using Quark (figure 5a) 
were in excellent agreement with the crystal structure in pre- 
dicting MuRFl^^ as a long a-helical shaft. They displayed the 
C-terminal COS-box as a compact arrangement, where two 
short helices folded into an a-hairpin that packed against 
the shaft fraction. The resulting three-helix bundle resembled 
a minimal version of the spectrin fold, where two helices lie 
parallel to each other and the third is a cross-connector [36]. 
Nine out of the 10 predictions (electronic supplementary 
material, figure S2) returned by Quark shared this same 
broad fold and had predicted TM-scores of 0.51-0.45 
(over the threshold of 0.3 that indicates statistical signifi- 
cance). Ah initio modelling in Rosetta using the standard 



Table 2. SEC-MALLS measurements. MM calculated from sequence data 
(MMcaic) is quoted for the monomeric chain, with the value for a dimer given 
in brackets. Experimental MM values (MMgxp) have been measured using 
SEC-MALLS (data shown in the electronic supplementary material, figure S3). 





^^214-271 


H,,69 - 263 


B2|CC|C0S"'-"' 




MM,aic (kDa) 


7 (14) 


11.3 (22.6) 


24 (48.1) 


17.4 (34.7) 


MMexp (kDa) 


11.8 


37.4 


49.2 


32.3 



^Given here for comparison, values reported in [28] for the full-length HD 
of MuRFI spanning HI and COS regions. 



protocol was unsuccessful, with the program overriding long 
helices suggested by secondary structure predictions to pro- 
duce compact, globular folds. This is a known limitation of 
RosETTA when handling anisometric structures [33]. Thus, 
we next provided Rosetta with the crystal structure of 
MuRFl^^ as a fixed fraction. This resulted in the largest clus- 
ter containing 150 of the total 1000 models, indicative of a 
satisfactory result (figure 5h). The model shared the same 
topology with Quark models (electronic supplementary 
material, figure S2d), this consistency being a further 
indication of reliable modelling. 

We then performed ab initio modelling in Quark of the full- 
length HD of MuRFl, comprising helix HI and COS-box 
(MuRFl^^). The models (figure 5c) represented the N-terminal 
fraction of the domain as a single uninterrupted helix and were 
consistent with those of MuRFl^^ plus COS-box described 
above. This result is in excellent agreement with secondary 
structure predictions (figure 2) and EM data. 

Interestingly, a comparison of ah initio models with the 
crystal structure of MuRFl^^ revealed a remarkable simi- 
larity in helix packing (electronic supplementary material, 
figure S4). In the MuRFl^^ tetramer, helices arrange them- 
selves along the same interfaces as those occupied by the 
COS-box a-hairpin in the ab initio models. This suggested 
that the crystallographic arrangement was a compensatory 
conformation aimed to satisfy naturalistic interfaces. 

Finally, we sought validation of the ah initio models by 
testing the interaction of independently produced samples 
of MuRFl^^ and the COS-box a-hairpin (MuRFl^^^). Co-seg- 
regation of both samples in size-exclusion chromatography 
(figure 4c; electronic supplementary material, figure S5) 
confirmed the interaction of the two segments. Our earlier 
work [11,29] showed the HD of MuRFl (spanning HI and 
COS-box) to be dimeric, similar to MuRFl ^^'^^'^^^ in this 
work. However, individually MuRF^^ forms higher assem- 
blies (table 2). This led us to conclude that the interaction of 
HI and COS-box prevents the non-native association of the 
HD, and thus that the COS-box is required to achieve 
productive homodimerization. 

3.6. In vivo expressed COS-box targets sarcomeric 
structures similar to full-length MuRFl 

We tested whether the COS-box contributes to sorting MuRFl to 
its in vivo locations. For this, we first confirmed the localization of 
endogenous MuRFl by iramunostairung (figure 6a-c). This 
detected MuRFl mostly in the Z-disc (consistent with its 



interaction with Z-disc proteins [20,21]) and also, more discre- 
tely, in the M-line region (consistent with its binding to titin 
A168-170 [11,12]) (figure 6a-c). Endogenous MuRFl is also 
known to localize to the neuromuscular junction [37]. Then, 
we transfected the tibialis anterior (TA) muscle of adult mice 
with a COS-GFP fragment. In vivo imaging of the overexpressed 
COS-box showed a regular pattern of striations in the myofibril, 
indicative of its targeting of defined sarcomeric structures 
(figure 6d,e). In addition, GFP fluorescence was also present in 
punctate structures co-locaHzing with endocytic acetylcholine 
receptor at the neuromuscular junction (figure 6/,g). To deter- 
mine the precise localization of COS-GFP in the sarcomeric 
striations, sections of the imaged muscles were prepared and 
stained against f-actin using the marker phalloidin-TRTTC 
(figure 6/z). This assigned the predominant in vivo targeting of 
COS-GFP to the Z-Hne/I-band region. The fainter binding at 
the M-line was no longer detectable, probably having been 
disrupted by the fixation procedure. On the whole, these data 
indicate that the localization of COS-GFP is consistent with 
that of endogenous MuRFl. Furthermore, the findings comp- 
lement a recent study on a pathogenic mutation of MuRFl, 
Q247'^, linked to hypertrophic cardiomyopathy [38]. The 
mutation results in a truncated protein that lacks the COS-box. 
Truncated MuRFl remains diffuse in the cytoplasm, no longer 
targeting sarcomeric structures and with a near-total loss of 
ubiquitinating function. It can be concluded that the COS-box 
is an important mediator of MuRFl interactions in vivo, and 
that it is necessary and sufficient for the recruitment of MuRFl 
to the sarcomeric cytoskeleton. 

4. Discussion 

CC domains are thought to drive the organization of TRIM 
proteins into functional assemblies. However, our analysis 
of the CC from MuRFl (MuRFl^^ and MuRFl^^) suggests 
that this domain does not govern molecular order in this 
TRIM but that flanking domains modulate its associative 
function. It is not rare that CC domains extracted from their 
protein context no longer assemble natively. For example, 
the coil 2 from lamin A forms parallel dimers in its intermedi- 
ate filament context, but antiparallel dimers in isolation [39]. 
To compensate for the promiscuity of the CC fold, many 
CC-containing proteins have additional motifs that condition 
the self-assembly of these domains. An example is the 
dimeric dystrophia myotonica protein kinase (DMPK), 
whose C-terminal CC domain forms robust, but artefactual, 
trimers in isolation [40]. DMPK assembles into functional 
homodimers by means of an N-terminal association motif 
that dictates the subsequent interaction of the CC fraction 
[41]. In MuRFl, the B2- and COS-boxes flanking helix HI 
act as terminal clamps that secure the correct self-assembly 
of the CC. Our previous structural characterization of the 
MuRFl B2-box showed that it forms homodimers in solution 
with high affinity [29]. Studies on other TRIM proteins have 
confirmed that B-boxes also form homodimers in those cases 
[42]. Thus, our data explain the need for the B2-box to pre- 
cede and pre-define the chain registry of the CC domain so 
as to initiate its productive assembly by vicinal confinement. 
This provides a rationale for the universal presence in the 
TRIM fold of the B2 box immediately N-terminal to the CC 
motif, forming an evolutionarily conserved core unit. Here, 
we also show that at the C-terminus, CC and COS-box 
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Figure 4. Characterization of MuRF1^^'^^'^°l [a) Electron micrograph of negatively stained MuRF1^^'^^'^°^ samples corresponding to the outermost tail fractions of size- 
exclusion chromatograms containing dimeric assemblies (electronic supplementary material, figure S3), (b) Gallery of three major class averages obtained by using the 
processing software EMAN1. (c) Complexation of MuRFl"^ and MuRF1^°^ samples monitored by size-exclusion chromatography. The complex was formed by mixing the 
samples in a molar ratio of 1 : 2.5 in 20 mM Tris-HCI pH 7.5, 200 mM NaCI, 1 mM DTT followed by incubation for 1 h at 4°C. The mixture was run on a Superdex 200 
HiLoad 26/60 column. Chromatogram and associated SDS-PAGE are shown. MW marker is SeeBlue Plus2 Pre-stained standard (Invitrogen) (samples are proximal to the 
6 kDa band), (cf) Proposed quaternary structure of MuRFI compiling known and predicted structural information on MuRF1. The structure of the B2-box dimer is that 
previously elucidated by X-ray crystallography (PDB 3DDT) [29]; the model for the HD and its dimeric assembly is as deducted in the current study. 



sequences are integrated into a structural unit, together form- 
ing an HD, where the COS-box might prevent fraying of the 
CC and aid the formation of functional dimers. This is the 
first example of how a specific C-terminal domain is inte- 
grated within the conserved TRIM fold to modulate its 
structure and function. We anticipate that this architectural 
design is shared by all 10 TRIMs in classes I- III where the 
CC is immediately followed by a COS-box [5]. In addition 
to the close MuRFl homologues MuRF2 and MuRF3, these 
classes include proteins such as the Opitz syndrome protein 
Midline-1 (MIDI) and its homologue Midline-2 (MID2), the 
brain-specific TRIM9 that is seemingly linked to neuronal 
dysfunction in dementia, and Harprin (TRIM36), thought to 
regulate the acrosome reaction in sperm during fertilization. 

We predict that the COS-box has a minimalistic spectrin- 
like fold. The spectrin fold has been particularly observed in 
microtubule-associated proteins [36], consistent with the role 
attributed to the COS-box [7,14]. The mutation of the con- 
served motifs FLQ and LDY in the COS-box of MIDI 



(respectively, 275-FLL-277 and 323-IDF-325 in MuRFl) 
were shown to independently abolish the interaction with 
microtubules [7]. Ab initio models in this study indicate 
that these motifs co-localize at the termini of the COS a- 
hairpin, at the base of the HD (figure 5a). This suggests 
that these motifs are important for the correct folding of 
the COS-box and /or that they form a key interaction locus 
in the fold. Furthermore, previous SPOT-blots identified a 
C-terminal sequence as the primary interaction site of 
MuRFl with M-line titin [20]. The mapping of this sequence 
on our secondary structure predictions (figure 2) and on 
the ah initio models (figure 5) shows that it corresponds to 
helix H3. This helix is in the outermost position in the 
models and thus is largely accessible. We conclude that 
our model of MuRFl COS-box rationalizes current binding 
data on this motif. Finally, we summarize the findings 
from this study in the proposal of an overall structural 
model of MuRFl (figure M) that might guide its further 
functional study. 



{a) (b) (c) 




Figure 5. Ab initio modelling of MuRFI COS-box. [a) Top Quark model spanning MuRFI plus the subsequent COS-box region coloured in a blue-to-red gradient. 
The model is superimposed on the crystal structure of MuRFI (grey). The pathogenic Q247^ mutation is shown in black and motifs previously identified to mediate 
microtubule binding in protein MIDI are in magenta [7]. Additional Quark models are shown in the electronic supplementary material, figure S2. (b) Rosetta- 
calculated model of the same segment derived from a cluster of 150 models. A comparison of Quark and Rosetta models is shown in the electronic supplementary 
material, figure S2. (c) Top Quark model of the full-length HD of MuRFI (the degree of bending of the long helix HI is not meaningful). 



5. Conclusion 

The change in demographics and inactive lifestyles is making 
muscle loss an endemic problem in the populations of devel- 
oped countries. Hence, the study of pathways that regulate 
the degradation of muscle proteins is of high interest. Over 
50 potential MuRFI targets have been proposed based on 
yeast two hybrid screens, including sarcomeric proteins of 
the Z-disc and M-lines [20,21], and the neuromuscular junction 
[37]. Our findings suggest that the COS-box is a region of high 
structural and functional importance in MuRFI. It is a necess- 
ary interaction motif that mediates MuRFI recruitment to 
myocellular structures, and thus the pharmacological pertur- 
bation of its targeting might open new avenues for the 
control of MuRFI -mediated atrophy of the myofibril. 



protease cleavage site N-terminal to the target construct. 
MuRFl^^ (containing the mutation C173S) was cloned into 
pETM-20 (EMBL collection) using the restriction sites Ncol 
and BamHI, fusing an N-terminal His6-TRX tag to the target 
construct, cleavable by TEV protease. MuRFl^^^ was cloned 
into the Ncol and Acc65I sites of pETZZ (http://babel.ucmp. 
umu.se/ cpep/ web_content/ Pages/ CPEP_09_vectors.html) to 
produce a TEV-protease-cleavable N-terminal Hise-ZZtag con- 
struct. The expression clone of MuRFl^^'^^'^^^ (containing 
the mutation C298S) has been previously reported [29]. 

The C-terminal fusion of the COS-box to GFP was 
obtained by cloning MuRFl^^^ into the pEGFP-Nl vector 
using Xhol and EcoRI sites. 

All constructs were verified by sequencing (Geneservice). 



6. Methods 

6.1. Sequence analysis 

Sequences of MuRF proteins were obtained from the Uni- 
protKB database and aligned with Clustalw2 [43] using the 
BLOSUM matrix. Secondary structure predictions of the heli- 
cal regions of MuRFI, -2, -3 used Jpred3 [44]. The probability 
for CC formation was calculated with MultiCoil [45] using a 
sequence window of 28 residues. 

6.2. Cloning 

Human MuRFl^^ (UniProtKB Q969Q1) was cloned into the 
vector pETM-11 (EMBL collection) using Kpnl and Ncol restric- 
tion sites. This vector incorporates a Hise-tag and a TEV 



6.3. Protein production 

MuRFl samples were overexpressed in Escherichia coli Rosetta2 
(DE3) or BL21 (DE3) (Novagen). Cultures were grown at 30°C 
up to an ODeoo of 0.6 in Luria Bertani medium supplemented 
with 20 |JLg ml"^ chloramphenicol and 30 |ULg ml~^ kanamycin. 
Expression was induced by 0.75 mM isopropyl-p-D-thiogalac- 
topyranoside (IPTG) and cultures were grown for a further 
20 h at 20°C. Cells were harvested by centrifugation at 4°C. 
Bacterial pellet was resuspended in lysis buffer (50 mM Tris 
pH 8.0, 50 mM NaCl, 0.5 mM p-mercaptoethanol) containing 
a protease inhibitor cocktail (Roche). Lysis was carried out 
by French pressing in the presence of DNAse. The homogenate 
was clarified by centrifugation and the supernatant applied to 
a Ni^^-chelating His trap column (GE Healthcare) equilibrated 
in lysis buffer containing 20 mM imidazol. Elution used 
200 mM imidazol. Tag removal was by incubation with TEV 
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Figure 6. MuRFI COS-box targeting of myocellular structures in vivo, {a-c) Localization of endogenous MuRF1 within sarcomeres. Muscle tissues were dissected 
from mice 5 days post-denervation, single myofibrils prepared from gastrocnemius and immunostained for (a) MuRF1 or [b] desmin. A merge of {a) and {b) 
indicates that endogenous MuRFI under muscle stress targets predominantly (c) the Z-disc region and to a lesser extent the M-line. {d-g) In vivo targeting of 
transiently expressed MuRFI COS-GFP fusion protein in transfected skeletal muscle. Depicted is {d,e) the enrichment of COS-GFP in sarcomeric striations as well as 
(f,g) an enrichment in puncta containing endocytic AChR ((f) a-bungarotoxin staining; (g) COS-GFP; arrowheads mark COS puncta that are also positive for AChR). 
A negative control corresponding to transfected GFP alone is shown in the electronic supplementary material, figure S6. (/?) After fixation and staining with phalloidin- 
TRITC against f-actin, longitudinal sections of the muscles depicted in (d-g) show enrichment of COS-GFP in the Z-line/l-band, mimicking the distribution of endogenous 
MuRFI. Faint immunopositive signal on the M-line (a) is also present with COS-GFP in vivo (e) but does not stand fixation and staining (/?). 



protease overnight at 4°C during dialysis against lysis buffer. 
The protein was then subjected to subtractive metal affinity 
chromatography followed by gel filtration on a Superdex 75 
16/60 HL column or on a Superdex 200 26/60 HL column 
equilibrated in 20 mM Tris pH 7.5, 200 mM NaCl (GE Health- 
care). The purified sample was stored at 4°C until further use. 
The production of MuRFl^^'^^'^^^ was as reported [29]. 

6.4. Crystal structure elucidation 

Crystals were grown at 20°C in hanging drops using 48-well 
plates (Hampton Research). Drops consisted of 1 |jl1 protein 



solution at 14 mg ml ^ and I |xl mother liquor containing 
35% MPD, 0.1 M sodium acetate pH 4.5, 20% [v/v] glycerol 
and 20 mM NaF. Crystals grew as thin plates with dimen- 
sions of 400 X 100 |JL^ in the measurable plane of the plate. 
For X-ray data collection crystals were flash frozen in liquid 
nitrogen. X-ray diffraction data were processed with XDS/ 
XSCALE [46] (table 1). Crystals contained four MuRFl^^ 
chains in the asymmetric unit, corresponding to a solvent 
content of 46%. Analysis with POLARRFN revealed a two- 
fold (k = 180°) non-crystallographic axis contained within 
the crystallographic ac plane {co = 90°, (p = 153°). Phases 
were obtained using ARCIMBOLDO [47]. Subsequent 



model building used COOT [48] and ARP/wARP [49], while 
refinement was in PHENIX [50]. Building of solvent and 
ordered buffer components used PHENIX and COOT. 

6.5. Multi-angle laser light scattering 

Measurements were performed on a Dionex BioLC HPLC 
connected to an 18-angle light-scattering detector and a differ- 
ential refractometer (DAWN HELEOS-II and Optilab rEX, 
Wyatt). A Superdex 75 10/300 GL column (GE Healthcare) 
was used in 20 mM Tris pH 8.0, 50 mM NaCl at a flow rate 
of 0.75 ml min~^. Sample volumes of 1 ml were injected at a 
concentration of 1.5mgml~^. Samples eluting from the 
column passed through an in-line DAWN HELEOS-II laser 
photometer (A = 658 nm) and an Optilab rEX refractometer 
with a QELS d3mamic light-scattering attachment. Light- 
scattering intensity and eluent refractive index (concentration) 
were analysed using ASTRA v. 5.3.4.13 software to give a 
weight-averaged MM. To determine the detector delay 
volumes and normalization coefficients for the MALLS 
detector, a BSA sample (Sigma A-8531) was used as reference. 

The SEC-MALLS analysis of MuRFl^^'^^'^^^ samples 
was carried out as above but used a Superdex 200 10/300 
prep-grade column equilibrated in 20 mM Tris pH 8.0, 
100 mM NaCl. MuRFl^^'^^'^^^ was injected at 0.7 mg ml" \ 

6.6. Transmission electron microscopy and image 
processing 

Aliquots of 5 |jl1 sample were adsorbed onto a glow- 
discharged carbon film-coated copper grid, washed with 
three droplets of pure water and subsequently stained with 
2% uranyl-acetate. Images were recorded using a Philips 
CMIO TEM (The Netherlands) operating at 80 kV on a 
Veleta 4 k CCD camera (Olympus, Germany). 

Reference-free alignment was performed on manually 
selected particles from electron micrographs using the 
EM AN image-processing package [51]. Next, particle projec- 
tions were classified by multi-variant statistical analysis. The 
class averages with the best signal-to-noise ratio were selected 
and gathered in a gallery. 

6.7. Ab initio modelling 

The MuRFl sequence corresponding to residues 214-327 was 
submitted to the Quark ab initio modelling server [34]. For com- 
parison, fragment assembly-based ah initio modelling was done 



with RosETTA using default parameters {ab initio protocol) to pro- 
duce 1000 models [35]. This was done both with and without 
specifying that residues 214-271 must adopt the experimentally 
determined helical structure (flag fix_residues_to_). 

6.8. In vivo transfection, staining and imaging 

For the analysis of endogenous MuRFl distribution in TA, its 
expression was induced by two weeks of N. ischiadicus 
denervation. Single myofibrils from TA were prepared as 
before [52], and endogenous MuRFl was detected with 
three different polyclonal antibodies [21,37] (available from 
www.myomedix.com). Double labelling with desmin, 
coupled to AlexaFluor647, was used to determine Z-disc 
localized MuRFl epitopes. All staining was done using stan- 
dard protocols as previously described [53]. 

Expression of MuRFl fused C-terminally to EGFP was 
by transfection of the expression vector into TA muscles, as pre- 
viously described [54]. Ten days post-transfection, mice were 
anaesthetized, transfected muscles exposed and injected with 
the marker for acetylcholine receptors, a-bungarotoxin-Alexa- 
Fluor647, as previously described [37]. Mice were then 
transferred to a confocal microscope (DMRE TCS SP2, Leica 
Microsystems) and GFP fluorescence excited using a KrAr 
laser (488 nm). Emission was detected by a 63x/1.2NA HCX 
PL APO CS W CORR objective (Leica Microsystems) (immer- 
sion medium Visc-Ophtal gel, Winzer-Pharma) using 500- 
550 nm bandpass. Next, muscles were extracted and fixed in 
4% PFA/PBS overnight at 4°C, washed in PBS for 30 min 
and embedded in 2% agarose. Longitudinal slices of 50 |jim 
thickness were made using a Leica vibratome VTIOOO S, permea- 
bilized for 4-5 h in 0.1% Triton X-100 and washed in PBS. 
Sarcomeric actin was labelled with 250 nM phalloidin-TRITC 
(Life Technologies) in 2% BSA /PBS overnight; slices were then 
washed in 2% BSA/PBS for 1-2 h and embedded in Mowiol. 
GFP, phalloidin-TRITC and a-bungarotoxin-AlexaFluor647 
were excited at 488, 561 and 633 nm, respectively. Emission 
was detected at 500-550, 570-620 and 650-750 nm bandpass. 
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